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(54) Processor subsystem for use with a universal computer architecture 



(57) A processor subsystem includes a processor 
and a bus bridge conversion device for insertion into a 
slot of a host computer system. The processor operates 
according to a signaling protocol which is different than 
the signaling protocol of the computer system bus. The 
bus conversion device converts the signaling protocol 



of the system bus to the signaling protocol of the proc- 
essor, and vice-versa. The bus conversion device in- 
cludes logic for bus arbitration conversion, bus lock con- 
version, and cache coherency control. Logic is also in- 
cluded that converts incoming and outgoing requests so 
that the card may properly transact with other agents 
coupled to the bus. 
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Description 

FiELD OF THE INVENTION 

The present invention relates generally to the field 
of high-performance computer systems: particularly to 
signaling protocols and operational requirements asso- 
ciated with buses for use with advanced microproces- 
sors. 

BACKGROUND OF THE INVENTION 

The development of ever more advanced micro- 
processors and associated bus architectures continues 
at a rapid pace. For example, Intel Corporation of Santa 
Clara, California, recently introduced their latest proc- 
essor, the Pentium Pro™ at a time when the market de- 
mand for the previous generation Pentium® processor 
was still expanding. The Pentium Pro™ processor offers 
a great boost in performance over the previous gener- 
ation Pentium® processor and it introduces a new high- 
performance, pipelined system bus architecture. 

A dilemma arises for computer system developers 
who wish to design systems based on the latest bus ar- 
chitecture, but who still wish to accommodate processor 
designs based upon previous generation processors, or 
processors having alternate bus architectures. For in- 
stance, many system designers would like to design 
their computer systems so as to be compatible with In- 
tel's new Pentium Pro™ bus - to take advantage of 
state-of-the-art features - but who still must meet the 
current market demand for compatibility with the Pen- 
tium® processor architecture. This generally means that 
the system processor must have the same pinout and/ 
or operate in accordance with the bus architecture de- 
signed for the Pentiunn® processor. 

Unfortunately, the bus signaling protocol of the Pen- 
tium Pro™ processor differs significantly from the bus 
signaling protocol of the Pentiunn® processor. As a re- 
sult, system developers have been left with the unsavo- 
ry choice of either maintaining compatibility with main- 
stream processor designs - while foregoing the advan- 
tages associated with a state-of-the-art bus architecture 
- or, designing their system in accordance with the sig- 
naling protocol of the advanced bus architecture of the 
Pentium Pro™ processor - at the expense of reduced 
market acceptability due to the broader customer base 
for Pentium® processor-based systems. Therefore, 
what is needed, is a computer system architecture 
which is adapted for use with a variety of processor 
types, with each processor type potentially employing 
an alternate bus architecture. 

As will be seen, the present invention overcomes 
the problems inherent in the prior art by providing a proc- 
essor subsystem in the form of a plug-in card or module 
coupled to a host through a standardized computer bus 
interface. The invention offers compatibility with a wide 
variety of processor types while obviating the need to 



modify basic system architecture. The present invention 
advantageously allows a previous Generation proces- 
sor (e.g.. a Pentium® processor) to transfer information 
to/from other agents coupled to the high-performance 
s (i.e., Pentium Pro™ processor) system bus in a seam- 
less manner 

SUMMARY OF THE INVENTION 

^0 The present invention covers a processor subsys- 
tem which includes a processor and a bus conversion 
device for use with a host computer system that sup- 
ports a standardized system bus. A standardized bus 
interface provides connection to the system bus and 

'5 permits communication between the processor subsys- 
tem and the host. The host system includes components 
such as a host processor, a main memory, periphery de- 
vices, etc. The processor within the subsystem operates 
in accordance with a specific bus interface that differs 

20 from the standardized bus Interface of the host system. 
The bus conversion device provides conversion be- 
tween the standardized bus interface of the host system 
and the specific bus interface of the processor 

In one embodiment, the processor subsystem is 

25 housed on a card that may be plugged or inserted into 
a slot of the host computer system. The card includes 
the processor subsystem having a processor that oper- 
ates according to a specific signaling protocol, which is 
different than the signaling protocol of the system bus. 

30 A bus conversion device is coupled to the pins of the 
processor and the standardized bus interface of the 
card, which connects to the system bus. The bus con- 
version device converts the signaling protocol of the 
system bus to the signaling protocol of the processor 

35 subsystem, and vice-versa. 

The host computer system, which interfaces to the 
processor subsystem, may support a variety of autono- 
mous agents. These autonomous agents may comprise 
additional processors, various well-known devices such 

40 as direct-memory access (DMA) devices, or other sub- 
systems. These agents communicate with one another 
via a pipelined bus architecture in one implementation. 
Agents coupled to the pipelined bus are classified as 
either symmetric agents or priority agents. The symmet- 
ric agents support a distributed arbitration scheme ac- 
cording to a circular priority algorithm. 

The processor subsystem is classified as a sym- 
metric agent on the pipelined bus and requests owner- 
ship of the bus on behalf of the processor in accordance 

so with the circular priority algorithm. In a specific imple- 
mentation, a round-robin algorithm may be employed. 
In most cases, priority agents may gain immediate own- 
ership of the bus over any symmetric agent by sending 
a priority request signal on the bus. 

55 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be understood more fully 
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from the detailed description which follows and from the 
accompanying drawings, which, however, should not be 
taken to limit the invention to the specific embodiments 
shown, but rather are for explanation and understanding 
only. 

Figure 1 illustrates the pinout of the Pentium® proc- 
essor. 

Figure 2 is a high-level, block diagram of one em- 
bodiment of the present invention. 

. Figure 3 shows an emtx)diment of the invention in 
a computer system which includes multiple agents cou- 
pled via a high-performance bus. 

Figure 4 is a conceptual block diagram of one em- 
bodiment of the bus conversion device shown In Figure 
2. 

Figure 5 illustrates one embodiment of the arbitra- 
tion converter circuit shown in Figure 4. 

Figure 6 Is a block diagram of one embodiment of 
the outgoing request converter circuit shown in Figure 4. 

Figure 7 shows an embodiment of the bus lock con- 
verter illustrated in Figure 4. 

Figure 8 shows an embodiment of the incoming re- 
quest converter of Figure 4. 

Figure 9 is a block diagram of one embodiment of 
the cache coherency control unit shown in Figure 4. 

Figure 10 illustrates the bus request protocol con- 
version logic unit utilized in one embodiment of the 
present invention. 

DETAILED DESCRIPTION 

A processor subsystem for a universal computer ar- 
chitecture is described that includes a processor housed 
on a plug-in card or module along with a bus conversion 
device that converts the signaling protocol of the proc- 
essor to the signaling protocol of a high-performance 
system bus. In the following description, numerous spe- 
cific details are set forth, such as agent types, logic 
blocks, signal connections, etc . in order to provide a 
thorough understanding of the present invention. Prac- 
titioners having ordinary skill in the data processing arts 
will understand that the invention may be practiced with- 
out many of these specific details. In other instances, 
well-known signal timings, components, and circuits 
have not been described in detail to avoid obscuring the 
invention. 

Those skilled in the computer arts will further appre- 
ciate that the description that follows includes various 
terms which have specific, definite meaning within the 
relevant field. For instance, the term "standardized bus 
interface" is known by skilled practitioners to mean that 
the signal transmission between an agent and a bus fol- 
lows a commonly used signaling protocol that applies 
throughout the system. Furthermore, the standardized 
bus interface does not necessarily refer to an industry 
standard of the type promoted by various professional 
organizations. Rather the interface may be selected 
based on any one of a number of criteria. By way of ex- 



ample, an embodiment of the present invention is dis- 
closed in which the standardized bus interface is based 
upon a high-performance bus architecture that provides 
backward compatibility with prior processor genera- 

5 tions. In other instances, the standardized bus interface 
may be based upon a relatively simple bus architecture 
to reduce cost or to support a higher volume market with 
reduced component complexity. 

Figure 1 illustrates the signal interface of Intel's 

10 Pentium® microprocessor. By way of background, a de- 
tailed explanation of the operation and functions of the 
signal shown in Figure 1, as well as the bus signaling 
protocol associated with the Pentium® microprocessor, 
can be found in numerous publications. For instance, 

15 the book entitled, "Pentium Processor System Architec- 
ture, Second Edition", by Don Anderson and Tom 
Shanley. Addison-Wesley Publishing Company (1995) 
contains such a description. 

Although the following discussion refers to the Pen- 

20 tlum® microprocessor in exemplary embodiments of the 
present invention, it should be understood that there 
presently exist - or will exist in the future - other micro- 
processors that employ alternate intemal architectures 
while remaining pin-compatible with the Pentium® mi- 

25 croprocessor. Similarly, other processors provide sig- 
nals having similar functions and have been designed 
with a different pinout, but which maintain compatibility 
with the bus architecture defined for the Pentium® mi- 
croprocessor. 

30 Therefore, reference to the Pentium® microproces- 
sor in the present application should be taken to Include 
these equivalents. To put it another way, the present in- 
vention is not limited to the Pentium® microprocessor; 
rather, the invention may be practiced with any of a va- 

35 riety of data processing devices. In addition, while the 
present invention is described in the context of a host 
system based upon the Pentium Pro™ processor archi- 
tecture, those skilled in the art should understand that 
processors having different architectures may be readily 

40 adapted to take advantage of the benefits offered by the 
present invention. 

Referring now to Figure 2, there is shown a high- 
level, block diagram of one embodiment of the present 
invention. In the embodiment of Figure 2, a plurality of 

4S integrated circuit devices is mounted or housed on a 
printed circuit board (PCB) card (or module) 200. Card 
200 includes a processor 11 coupled to one or more 
cache memories 12 along lines 17. Also Included on 
card 200 is a bus conversion device 1 5 that is connected 

50 with processor 11 via lines 18. Bus conversion device 
15, in turn, is connected to system bus 101. In the de- 
scribed embodiment, system bus 1 01 comprises a high- 
performance pipelined bus having a standardized bus 
interface. 

55 Data, address and control lines 19. 13 and 16, re- 
spectively, may be connected directly from processor 11 
to system bus 101 (as shown), or alternatively coupled 
through conversion device 15. 
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As will be described In more detail shortly, bus con- 
version device 1 5 functions to convert the bus signaling 
protocol and electrical characteristics associated with 
processor 11 to the signal protocol and electrical char- 
acteristics associated with system bus 101, and vice- 
versa. In this way, information may be transferred be- 
tween the processor and other agents coupled to sys- 
tem bus 101. (Note that signal lines 14 are intended to 
represent there may be other inputs/output lines to proc- 
essor 11 coupled to other devices included on card 200. 
For example, lines 14 may include clock input signals, 
operating potential supply lines, connections to addi- 
tional integrated circuits, etc.) 

With reference now to Figure 3, there is shown an 
embodiment of the present invention in a computer sys- 
tem which includes multiple agents coupled to a high- 
performance bus 101 . In the embodiment of Figure 3, 
bus 101 comprises a pipelined bus having an architec- 
ture such as that designed for the Pentium Pro^^ proc- 
essor. Processor card 200 Is Illustrated as having a plu- 
rality of interface terminals disposed along one edge. 
These terminals plug into a mated connector 201 having 
a slot with a set of corresponding terminal connections. 
For example, this configuration may represent a daugh- 
tercard that connects to a motherboard via an available 
slot within the computer system. This structure permits 
high-performance bus 101 to interface to a number of 
processor cards, each of which may Include any of a 
variety of different processors. 

As can be seen, processor card 200 comprises a 
processor (P^^) 11 coupled with cache memories 12a- 
1 2c, and also interconnected to bus 1 01 via bus conver- 
sion device 15. Other devices coupled to bus 101 in- 
clude bus master 205, direct memory access (DMA) de- 
vice 202, I/O device(s) 207, expansion devices 203 and 
a group of processors (Pq) organized into a cluster 206. 
By way of example, processors Pg may comprise Pen- 
tium Pro^^ processors compatible with the bus architec- 
ture of pipelined bus 101. Thus, a multiprocessor (MP) 
system is illustrated in the computer system of Figure 3. 

For purposes of the following discussion, high-per- 
formance bus 101 may be taken to designate the Pen- 
tium Pro^" processor bus of Intel Corporation. The sig- 
naling protocol associated with the Pentium Pro^^ proc- 
essor bus is set forth in the publication, "Pentium Pro™ 
Processor Family Developer's Manual, Volume 1 : Spec- 
ifications" (1 996), which publication is available from In- 
tel Corporation by calling 1-800-879-4683, and which 
publication is herein Incorporated by reference. To bet- 
ter understand certain aspects of the present invention, 
however, various attributes of the Pentium Pro^" proc- 
essor bus will be discussed as they pertain to the inven- 
tion. 

To begin with, it should be understood that accord- 
ing to the bus arbitration protocol of the Pentium Pro"^*^ 
processor bus, there are two classes of bus agents: 
symmetric agents and priority agents. Symmetric 
agents support fair, distributed arbitration using a round- 



robin (i.e., circular priority) algorithm. Each symmetric 
agent has a unique agent identification (ID) that is as- 
signed at RESET: a rotating ID value that reflects the 
symmetric agent with the lowest priority for the next ar- 

s bitration event; and a symmetric ownership state indi- 
cator that indicates either a "busy" or an "idle" state. In 
the embodiment of Figure 3, for instance, processor 
card 200 appears as one of several symmetric agents 
coupled to bus 101. 

^0 In a typical system configuration, such as that 
shown in Figure 3. symmetric agents may include de- 
vices such as processors Pg, organized individually or 
into a cluster 206. bus master 205. and various expan- 
sion devices 203, shown coupled to bus 1 01 . In the com- 

^5 puter system of Figure 3, direct memory access (DMA) 
device 202 may function as a high priority agent since 
It normally requires a direct path to memory. In other 
configurations, the priority agent(s) may comprise a high 
priority arbiter that arbitrates on behalf of multiple I/O 

20 devices, as depicted by I/O block 207 in Figure 3. 

An arbitration event is the process by which a new 
symmetric bus owner is determined and changed. On 
every arbitration event, the symmetric agent with the 
highest priority becomes the symmetric owner Note that 

25 the symmetric owner Is not necessarily the overall bus 
owner since a priority agent, having a higher priority than 
any symmetric owner, may grab ownership of the bus 
from a symmetric agent. Once the priority agent arbi- 
trates for the bus, it prevents the symmetric owner from 

30 entering into a new request phase unless the new trans- 
action is part of an ongoing bus locked operation. Bus 
locked operations are atomic operations which cannot 
be interrupted. An example of such a bus operation is a 
read-modify-write cycle. 

35 With reference now to Figure 4, there Is shown a 
conceptual block diagram of one embodiment of the bus 
conversion device 1 5. previously shown in Figures 2 & 
3. Dashed lines 10 and 100 represent the bus signal In* 
terface between converter 15 in both processor 11 and 

40 bus 101 . respectively. Practitioners in the art will under- 
stand that each of interfaces 10 and 100 may include 
data and address latches, voltage translations circuits, 
signal conversion logic, etc., as necessary for translat- 
ing and converting the voltage and logic levels associ- 

^ ated with each of the respective bus Interfaces. 

In the case of bus interface 100, which interfaces to 
pipeline bus 101, included therein is a interface unit in- 
order queue (lOQ) 70. The lOQ 70 is a standard logic 
block associated with the Pentium Pro™ processor bus 

so which keeps track of the current state of bus 1 01 . AH bus 
agents maintain identical tOQ status to track every 
transaction that is issued to the bus. When a transaction 
is issued to the bus, it Is also entered In the lOQ of each 
agent. 

55 The depth of lOQ 70 is the limit of how many in- 
order transactions can be outstanding on the bus simul- 
taneously. Because transactions receive their respons- 
es and data in the same order as they were issued, the 
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transaction at the top of lOQ 70 is the next transaction 
to enter the response and data phases. After completion 
of the response phase, a transaction is rennoved from 
the lOQ. The lOQ is also responsible for sending signals 
such as HIT#/HITM#. DRDY#, DBSY#, as well as other 
bus protocol signals. The default depth of the lOQ is 
eight; however bus agents may configure lOQ 70 to a 
depth of one by activation of pin A7# on RESET#. 

Included within bus conversion device 15 is a bus 
arbitration converter (BAG) 60 that converts the arbiter 
signals associated with the different bus protocols. For 
example, BAG 60 allows processor 11 to gain control or 
ownership over pipelined bus 101 by having its arbiter 
signals properly converted to the bus protocol associat- 
ed with bus 101. 

A detailed block diagram of bus arbiter converter 60 
is illustrated in Figure 5. Included within BAG 60 are 
agent identification (ID) determination state machines 
61 , symmetrical owner determination state machine 62, 
halt assertion logic 63, and bus request logic 64. It is 
appreciated that in the symmetric arbitration scheme of 
system bus 101, each agent requesting access to the 
bus must assert an appropriate request (BREQ#) signal. 
For example, processor 11 may indicate Its intention to 
send a request to bus 101 by driving the BREQ# signal 
line in accordance with the bus protocol of processor 1 1 . 

The signals BREQ, LOGK#, and HLDA are input to 
processor 11 from bus request logic 64. (Note that each 
signal that assumes a logic low state when asserted is 
followed by the pound sign, #. By way of example, the 
LOGK# signal is asserted low when processor 11 Is per- 
forming an atomic transaction.) 

The BREQ signal is the bus request output asserted 
by processor 11 to inform bus request logic 64 that it 
needs to obtain ownership of bus 1 01 in order to perform 
a bus cycle. The HLDA signal is a hold acknowledge 
output asserted by the processor 11 to inform other re- 
questing devices that it no longer owns the bus. Accord- 
ing to the bus protocol associated with processor 1 1 , HL- 
DA is not asserted until all outstanding bus cycles (i.e., 
that have been previously pipelined) have completed. 
The bus request logic 64 converts the bus request signal 
generated by processor 11 into the BREQ#[0] signal 
compatible with the bus request signal associated with 
the protocol of system bus 101 . 

In one embodiment, bus arbiter converter 60 gains 
ownership of bus 101 as a symmetric agent in accord- 
ance with the round-robin arbitration scheme imple- 
mented on the Pentium Pro^" processor bus. In order 
to properly connect to bus 101 , each symmetric agent 
must be assigned a symmetric agent identification (ID). 
This Is the purpose of agent ID determination finite state 
machine 61 . Upon power-on or reset, symmetrical agent 
ID determination state machine 61 receives the BREQ# 
[3:0] signal lines so as to determine the agent ID assign- 
ment of processor card 200. The agent ID information 
is provided to the symmetrical owner determination 
state machine 62. 



To gain ownership of system bus 101, the symmet- 
ric agent BREQ#(0] signal is asserted and the symmet- 
rical owner determination state machine 62 examines 
the state of the bus to figure out when ownership may 
5 be obtained as a symmetric agent In accordance with 
the bus protocol of the Pentium Pro^" processor bus. 
The bus arbitration signals associated with system bus 
101 include BREQ#[3:0], BPRI#, LOCK# (shown in Fig- 
ure 7), and RESET#. The BREQ#[3:0] bus signals are 
10 connected to the symmetric agents including bus con- 
version device 15 of processor card 200 - in a rotating 
manner. This arrangement initializes every symmetric 
agent with a unique Agent ID during power-on or RE- 
SET Bus request signals BREQ#[3:01 provide a mech- 
15 anism by which individual symmetric agents send and 
receive bus ownership requests to arbitrate for owner- 
ship of a system bus 101 . 

The BPRI# signal is a priority request signal where- 
by a symmetric agent receives bus ownership requests 
from a high priority bus agent. For example, BPRI# may 
be asserted by a priority agent so that it can arbitrate for 
immediate ownership of bus 101. The symmetric agents 
coupled to bus 101 receive the BPRI# signal as an in- 
dication that a priority agent Is requesting ownership of 
the bus. 

Upon an arbitration event on bus 101, processor 
card 200 may become a symmetric owner of the bus if 
it currently has the highest priority in the system. A pri- 
ority agent may subsequently request the bus and ob- 
tain ownership over processor card 200. For example, 
I/O devrce 207 (see Figure 3) may obtain ownership of 
the bus from daughtercard 200 by asserting BPRI#, un- 
less, of course, processor card 200 is performing an 
atomic transaction and has asserted the LOGK# signal. 
Processor card 200, of course, may be designated a pri- 
ority agent in an alternative embodiment. 

Hold assertion logic 63 is coupled to receive the 
BPRI# signal from bus 101 and generate HOLD and 
AHOLD signals input to processor 11 . When the BPRI# 
signal is asserted on the bus 1 01 , this means that a high 
priority agent is intervening to obtain ownership of the 
bus. In response to the BPRI# signal, hold assertion log- 
ic 63 informs processor 11 not to Issue any more re- 
quests. 

Note that the AHOLD (address hold) signal causes 
processor 11 to cease driving its address bus, so that 
the processor is unable to control bus cycles. Therefore, 
driving the HOLD and AHOLD signals allows bus con- 
version device 15 to operate with other symmetric 
agents on bus 101 . This forces processor 11 to not own 
the bus by default, whereas, in the bus architecture of 
the Pentium® processor, the processor owns the bus by 
default. In a system configuration with multiple proces- 
sors or agents, use of the HOLD and AHOLD signals 
eliminates the possibility of two or more processors 
backing off each other thereby leading to a "live lock" 
situation. 

Bus request logic 64 also generates the BOFF# sig- 
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nal input to processor 11 to cause processor 11 to im- 
mediately release control of the bus in the event that a 
high priority agent requests ownership of the bus using 
the BPRI# signal. After the BOFF# signal is removed, 
processor 11 restarts the entire bus cycle. 

In an alternative embodiment, processor card 200 
functions as a high priority agent, instead of a symmet- 
rical agent, in the computer system. 

With reference now to Figures 4 & 6, bus conver- 
sion device 15 further comprises outgoing request con- 
verter (ORG) 20 which converts requests generated by 
the processor 1 1 to issue on system bus 101 In an ex- 
emplary embodiment, once ownership of bus 101 has 
been obtained by the Pentium® processor, the outgoing 
request encodings must be translated from the protocol 
of the Pentium® processor to the signaling protocol of 
the Pentium Pro"^** processor bus 101 . Further, the Pen- 
tium Pro"^^ bus implements a two-clock request cycle, 
whereas the Pentium® processor operates with only 
one clock cycle. Therefore. ORG 20 is responsible for 
properly sequencing requests issued by processor 11 
as well as converting to different request type pins from 
the Pentium® to the Pentium Pro^*^ processor bus. 

Outgoing request converter 20 includes request cy- 
cling finite state machine 21, request encoders 22 and 
23, and attribute encoder 26. The request cycling state 
machine 21 receives a signal on line 66 from BAG 60 
indicating that processor 11 has obtained ownership of 
bus 101 . At this point, processor 11 may assert its ADS# 
(address status) output signal to indicate a valid address 
and bus cycle definition are present. 

State machine 21 utilizes this information to gener- 
ate control signals coupled to multiplexer circuitry cir- 
cuits 24 and 25 for the various request and encoding 
changes so that the single cycle processor requests can 
be properly sequenced as two separate clock cycles on 
system bus 101. The encoding for the two request cy- 
cles is performed by blocks 22 and 23. which receive as 
inputs the write/read (W/R#) signal, the memory or I/O 
transaction (M/IO#) signal and the bus cycle definition 
(CAGHE#) signal from processor 11. The CAGHE# sig- 
nal is asserted by the processor during cache write-back 
cycles, and also to notify external memory that the proc- 
essor wishes to perform a cache line-fill using a burst 
bus cycle. 

In addition, request encoder 22 receives the D/G# 
(data/code) signal to indicate the type of read operation 
requested. 

The encoded output signals produced by blocks 22 
and 23 are input to multiplexer 24. The output selected 
from multiplexer 24 is controlled by the request cycling 
state machine 21 , as shown in Figure 6. When an ADS# 
signal is issued onto bus 101 by request cycling state 
machine 21, the various request types from processor 
11 are encoded during the first cycle by encoder 22. At 
this time, address signals provided by processor 11 are 
selected for output onto the system bus address lines, 
A# [35:3], via multiplexer 25. Basic information about 



the request type Is also placed onto the system bus 101 
via the REQ#[4:0] lines during the first cycle. Request 
encoder 22 may be implemented using ordinary combi- 
natorial logic that converts the Pentium® type pins into 
5 appropriate request-type signals on the Pentium Pro^"^ 
processor bus, according to one embodiment of the 
present invention. 

During the second clock cycle, multiplexer 24 se- 
lects the Reqb[4:0] output of encoder 23 for output onto 

10 the system bus 101. Also during the second cycle, the 
address signal lines are utilized to send additional infor- 
mation about the request type to the pipelined bus. This 
information includes various attributes of the request 
type such as the byte enable signal lines (BE#[7:0]): a 

IS split cycle (SCYC#) signal that notifies logic that the 
locked transfer request is twice the number of bus cycles 
expected; the page write-through (PWT) signal that 
specifies whether the L2 cache should use a write-back 
or write-through policy for the line being written to; and 

^0 the GAGHE# signal. This information is selected as the 
output onto the address lines of system bus 1 01 by mul- 
tiplexer 25 during the second cycle. 

In a system configuration which includes a respond- 
ing agent capable of deferring a transaction (i.e., caus- 
es ing it to complete out-of-order) ORG 20 should include 
logic that de-asserts the DEN# signal line with each re- 
quest issued on bus 101 . The DEN# signal is the defer- 
enable signal, and is driven on bus 101 on the second 
clock of the request phase on the EXF1#/Ab4# pin (i.e., 

30 bit 4 of the second cycle address pins). This signaling 
logic, for example, may be included in the ordinary com- 
binatorial logic of state machine 21 . De-asserting DEN# 
with each outgoing request prevents any responding 
agent from deferring a transaction. 

35 Address parity and request parity signals are also 
generated by blocks 27 and 28, respectively, onto bus 
101 as part of an outgoing request. 

Bus conversion device 15 further includes a bus 
lock converter (BLC) circuit 50 that enables the bus to 

40 be locked during atomic transactions; that is, transac- 
tions over several clock cycles that cannot be interrupt- 
ed. Although the lock protocol for the Pentium® proces- 
sor is similar to the lock signaling protocol Implemented 
on the Pentium Pro^"^ processor bus, exactly when a 
lock can be asserted in relation to the respective bus 
protocols varies. 

Figure 7 Illustrates a bus lock converter comprising 
a lock conversion finite state machine 51 that receives 
a LOCK# signal from processor IT as well as an input 

so from ORG 20. In system bus 101 , the LOCK# signal is 
a bi-directional signal bused among all agents. Accord- 
ing to the bus arbitration protocol of the Pentium Pro^*^ 
processor, a current bus owner may assert LOGK# to 
define an indivisible bus locked operation. 

ss Shortly after processor 1 1 asserts its LOGK# signal, 
system bus 101 receives the translated LOGK# signal 
from the lock conversion state machine according to the 
signaling protocol of bus 101. State machine 51 basi- 
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cally implemenls a variable delay -- waiting for signals 
such as ADS#. etc., to be generated by ORG 20 before 
it asserts the corresponding LOCK# signal on bus 101. 
The LOCK# signal on bus 101 lasts through the series 
of atomic transactions and then is deasserted after the 
transactions have been completed. Again, the de-asser- 
tion is performed In accordance with the appropriate sig- 
naling protocol of bus 101 . 

As can be seen in Figure 4, bus conversion device 
15 also includes an incoming request converter (IRC) 
30 for converting signal requests on the system bus into 
signals that can be Input into processor 11. According 
to the signaling protocol of the Pentiunn® processor, an 
external address strobe (EADS#) signal is asserted to 
tell the processor that a valid address is present on its 
local address bus, and that this address can be 
snooped. When the snoop is activated, the processor 
transfers the memory address from the bus to the cache 
directories and a look-up takes place. But because the 
Pentium Pro^*^ processor bus does not Include an 
EADS# signal, or its equivalent, the present invention 
implements the folowing scheme. 

Since pipelined bus 101 is commonly a multi-agent 
system bus, an ADS# signal on the bus indicates a re- 
quest by one of the multiple agents. All agents coupled 
to the bus simply observe these requests, and, depend- 
ing upon the request type, determine whether it is ap- 
propriate to snoop bus 101 , and also v\rtiat to do with the 
information. 

As shown in Figure 8. IRC 30 includes an incoming 
request conversion logic block 31 that receives request 
signals REQ#[4:0] from system bus 101 . Logic block 31 
also receives ADS# signal from bus 101. Incoming re- 
quest conversion logic 31 functions to determine if a re- 
quest on bus 101 is snoopable by processor 11. If the 
current transaction on system bus 101 is snoopable, 
logic block 31 causes EADS# to be driven into the proc- 
essor along with the address information. Note that IRC 
logic block 31 controls tri-state buffers 33 and 34 to as- 
sert/de-assert the EADS# signal and the address pins 
of processor 1 1 . 

Incoming request conversion logic block 31 also 
generates the invalidate (INV) signal to direct processor 
1 1 to either leave the cache line valid, or to mark It invalid 
in the case of a snoop hit. In the event that the cache 
line contains valid data, the line is placed in a Shared 
cache coherency state. Note that logic block 31 should 
make sure that the address hold (AHOLD) signal is as- 
serted so that processor 11 can properly snoop the ad- 
dress bus. This insures that cache consistency is main- 
tained. 

At this point it should be understood that the imple- 
mentation of IRC 31 shown in Figure 8 assumes that 
system bus 101 has been de-plpelined for purposes of 
signal convention. In alternative embodiments of the 
present invention, it may be desirable to not de-plpeline 
bus 101. In such an implementation, bus conversion de- 
vice 1 5 would further Include a queuing mechanism that 



would send all snoops to processor 11 while periodically 
stalling system bus 101. 

Also note that IRC 31 includes snoop control logic 
which prevents 36-blt requests on system bus 101 from 
5 being sent to the processor interface. For example, the 
Pentium® processor is limited to 32-bit requests. There- 
fore, IRC logic block 31 insures that requests which are 
greater than 32-blts are not snooped by processor 11. 
Figure 9 is a detailed diagram of the cache coher- 
10 ency control unit (CCC) 40 shown in the bus conversion 
device 15 of Figure 3. The embodiment of Figure 9 
shows CCC 40 comprising snoop result conversion 
state machine 41 coupled to dirty data buffer 42. which, 
in turn, drives the data bus through tri-state buffers 43. 
15 Once IRC 31 has signaled to state machine 41 that a 
snoop has been driven into processor 11 , a few clocks 
later the processor provides signals to CCC 40 indicat- 
ing that either a cache hit, a cache hit to a dirty line, or 
a cache miss has occurred. This information is signaled 
via signal lines HIT/HITM and ADS. 

Snoop result conversion state machine 41 also 
communicates with the input-order queue (lOQ) asso- 
ciated with bus 101. The lOQ monitors the system bus 
and sends appropriate signals to the state machine and 
tristate buffer 43 so that the data may be transferred to 
bus 1 01 in accordance with the signaling protocol of bus 
101. In one embodiment. lOQ comprises ordinary bus 
state tracking logic. The bus state tracking logic simply 
presents state information at the correct time via system 
bus 101. 

In one embodiment. lOQ comprises a plurality of 
entries wherein each entry translates to a current trans- 
action on the pipelined bus. Each lOQ entry tracks the 
state of the transaction as well. In this manner, all bus 
agents may keep track of each transaction as it flows 
through the pipeline on the bus. For the presently de- 
scribed embodiment, the lOQ has a depth of "1 " since 
the bus is de-pipelined for conversion to processor 11 . 

State machine 41 also communicates some of the 
Information from the snoop event back to the bus track- 
ing logic of system bus 101. in the event of a hit to a 
clean line, or a cache miss, state machine 41 simply re- 
ports this status to system bus 101 at the appropriate 
time. But if a hit occurs to a dirty line (i.e., the processor 
asserts HITM# signal), this situation requires special 
handling since processor 11 is designed to immediately 
output dirty data. 

When an ADS# signal is detected by state machine 

41 following a snoop hit, this means that dirty data will 
be immediately sent out by processor 11 . Since the sig- 
naling protocol of system bus 101 does not allow data 
to be dumped in this manner, dirty data buffer 42 cap- 
tures the data as it is output from processor 11 so that 
it could be sent onto bus 101 at the appropriate time. 
This process is, of course, under the control of snoop 
result conversion state machine 41 . The bus state track- 
ing logic in the lOQ is also coupled to dirty data buffer 

42 in the implementation of Figure 9 since it has the 
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information indicating when dirty data nnay be sent on 
systenn bus 101 to complete the transaction. 

Alternatively, snoop result conversion state ma- 
chine 41 may use the BRDY# signal to throttle the data 
being output by processor 11. In this alternative imple- 
mentation, buffer 42 may be eliminated, and the data 
bus lines from processor 11 may simply by tri-stated by 
tri-stale devices 43 before being coupled to system bus 
101. Note that in this embodiment, BRDY# is used to 
indicate that the currently addressed device has accept- 
ed data from the Pentium® processor in response to a 
write. In other words, snoop result conversion state ma- 
chine 41 operates to allow data to be transferred directly 
to system bus 101 by providing an appropriate hand- 
shake signal. 

To reiterate, logic block 41 takes the snoop result 
from processor 11 and passes it to the lOQ, which is 
incorporated in bus interface 101. In the event that a 
HITM# signal is asserted by processor 11 , snoop result 
conversion state machine 41 waits until the next ADS# 
signal has been output by processor 11 . since that sig- 
nals the beginning of the dirty data dump. 

Figure 10 illustrates the bus request protocol con- 
version logic unit 70, which comprises ordinary state 
machine and combinatorial logic. As shown, logic unit 
70 includes bus protocol conversion finite state machine 
72 and in-order queue 71. State machine 71 performs 
required protocol signal conversion between processor 
11 and system bus 101. In-order queue 71 is utilized to 
keep track of the state of both buses, on opposite sides 
of the interface. lOQ 71 includes ordinary logic and reg- 
ister storage for holding bus state information, as spec- 
ified in accordance with the protocol of system bus 101. 

In addition to tracking the state of the signals shown 
on both sides of the bus interface, logic unit 70 also pro- 
vides necessary handshaking signals, e.g., data trans- 
fer handshaking. (Note that in certain instances, the bus 
signals on opposite sides of the interface have closely 
related functions. For example, the ADS# signal cou- 
pled to processor 11 is a uni-directional address strobe 
signal. The corresponding conversion is to the ADS# 
signal of system bus 101, which is bi-directional in na- 
ture.) 

As can be seen, bus tracking logic unit 70 converts 
various signals transmitted between system bus 101 
and processor 11. Included within this group is the ad- 
dress parity error signal AERR# in accordance with one 
embodiment of the present invention. An agent detect- 
ing a parity error asserts the AERR# signal during the 
error phase of the transaction. All bus agents observe 
the AERR# and on the next clock abort the erroneous 
transaction by removing the transaction from the in-or- 
der queue and canceling all remaining phases associ- 
ated with the transaction. The corresponding address 
parity signal on the processor side of the interface is AP- 
CHK#. In the event of an address parity error, processor 
11 activates APCHK#. 

Also included on the system bus side of the inter- 



face of logic unit 70 are signals HIT#/HITM#. HIT# and 
HITM# are used to Indicate that a line is valid or invalid 
in the snooping agent, whether the line is in the modified 
(dirty) state in the caching agent, or whether the snoop 

5 phase needs to be executed. The HIT# and HITf^# sig- 
nals are used to maintain cache coherency at the sys- 
tem level. As explained previously, when a snooping 
agent asserts HITM# the agent assumes responsibility 
for writing back the modified line during the data phase 

10 (an implicit write-back). The DEFER# signal is also driv- 
en in the snoop phase. The DEFER# is deasserted to 
indicate that the transaction can be guaranteed in-order 
completion. An agent asserting DEFER# ensures prop- 
er removal of the transaction from the ICXD 71 by gen- 

'5 erating the appropriate response. The response signal 
group of a transaction occurs after a snoop phase of the 
same transaction and includes the signals RS#[2:0] 
which provide encodings describing the required snoop 
result. The response agent is the agent responsible for 
20 completing the transaction at the top of the lOQ 71 . For 
write transactions, TRDY# is asserted by the response 
agent to indicate that it is ready to accept write or write- 
back data. 

The data transfer signal group contains signals driv- 
es en in the data phase and includes DBSY#/DRDY# (data 
bus busy and data ready). DRDY# indicates that valid 
data is on the bus and must be latched. The data bus 
owner asserts DRDY# for each clock in which valid data 
is to be transferred. DBSY# is used to hold the bus be- 
30 fore the first DRDY# and between DRDY# assertions 
for a multiple clock data transfer The snoop, response, 
and data phase signals are all tracked via !OQ 71 . 

Bus protocol conversion state machine 72 provides 
outputs to processor 1 1 that include the next address 
3S input (NA#). The NA# input is asserted by a device that 
requests that the next bus cycle be sent over the buses 
prior to the current cycle ending. The extemal right buffer 
empty {EWBE#) signal is provided to processor 11 to 
ensure that memory operations occur in order of execu- 
te tion (i.e., strong memory ordering). This ensures that all 
buffered writes are completed by executing the next in- 
structbn. As previously discussed, the BRDY# signal in- 
dicates that the currently addressed device has present- 
ed valid data on the data bus pins in response to a read, 
45 or that the currently addressed device has accepted da- 
ta from the processor in response to a write. Similarly, 
KEN# is the cache control signal indicating whether a 
location being read is within cacheable address space. 
If an address is not cacheable, the KEN# line is driven 
so inactive telling the processor not to perform a cache line- 
fill. 



Claims 

55 

1 . A processor subsystem having an interface for con- 
nection to a bus of a host computer system, the bus 
operating according to a first signaling protocol. 
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comprises: 

a processor that operates according to a sec- 
ond signaling protocol which differs from the 
first signaling protocol; 

a bus conversion device, coupled to the proc- 
essor and to the interface, that converts the first 
signaling protocol of the bus to the second sig- 
naling protocol of the processor, and vice-ver- 
sa. 

2. The processor subsystem of claim 1 further com- 
prising: 

at least one cache memory coupled to the proces- 
sor. 

3. The processor subsystem of claim 2 wherein the 
processor, bus conversion device, and at least one 
cache memory are contained within a module, the 
module being classified as either a symmetric agent 
or a priority agent according to a distributed arbitra- 
tion scheme of the bus. 

4. The processor subsystem of claim 3 wherein the 
bus conversion device includes arbitration conver- 
sion logic that obtains ownership of the bus on be- 
half of the processor according to a circular priority 
algorithm. 

5. The processor subsystem of claim 4 wherein the 
bus conversion device further comprises: 

an outgoing request converter that converts 
requests from the processor into the first and sec- 
ond request cycles on the bus. 

6. The processor subsystem of claim 5 wherein the 
bus comprises a pipelined bus. 

7. The processor subsystem of claim 6 wherein the 
bus conversion device further comprises: 

an incoming request converter that identifies 
a transaction on the bus that is snoopable by the 
processor, the transaction being input by the incom- 
ing request converter to the processor. 

8. The processor subsystem of claim 7 wherein the 
bus conversion device further comprises: 

a cache coherency control unit that receives 
a signal from the processor indicating either a cache 
hit. a cache hit to a modified line, or a cache miss, 
the cache coherency control unit being responsive 
to the cache hit to the modified line by asserting/ 
deasserting a ready signal that throttles dirty data 
f rom the modified line output by the processor to the 
bus. 

9. The processor subsystem of claim 8 wherein the 
bus conversion device further comprises: 



a cache coherency control unit that receives 
a signal from the processor indicating either a cache 
hit, a cache hit to a modified line, or a cache miss, 
the cache coherency control unit Including a buffer 
s that stores dirty data responsive to the cache hit to 
the modified line output from the processor, the dirty 
data then being sent to the bus by the cache coher- 
ency control unit in accordance with the first signal- 
ing protocol. 

10 

10. The processor subsystem of claim 9 wherein the 
bus conversion device further comprises: 

a bus lock converter that converts a first lock 
signal generated by the processor in conjunction 
IS with an atomic transaction Into a second lock signal 
provided on the bus in accordance with the first sig- 
naling protocol. 

11. A card for Insertion in a slot of a computer system 
20 that includes a host processor coupled to a pipe- 
lined bus that operates according to a first signaling 
protocol, the card comprising: 

a plurality of interface terminals disposed along 
25 at least one edge of the card, the terminals be- 

ing adapted for insertion in the slot so as to pro- 
vide electrical connection to the pipelined bus; 
a plurality of semiconductor devices mounted 
on the card which includes: 

30 

a processor that operates according to a 
second signaling protocol which differs 
from the first signaling protocol of the pipe- 
lined bus; 

35 a bus conversion device that converts the 

first signaling protocol of the pipelined bus 
into the second signaling protocol of the 
processor and visa-versa, the bus conver- 
sion device having a first interface connect- 
40 ed to the terminals which is compatible with 

the first signaling protocol of the pipelined 
bus, and a second interface coupled to the 
processor which is compatible with the 
second signaling protocol of the processor 

4S 

12. The card of claim 11 wherein the first signaling pro- 
tocol comprises an arbitration scheme that includes 
symmetric and priority classifications, wherein the 
card is classified as a symmetric agent on the pipe- 

so lined bus. 

13. The card of claim 11 wherein the bus conversion 
device further includes: 

ss bus request logic that converts bus request sig- 

nals of the processor to bus request signals of 
the pipelined bus in accordance with the first 
signaling protocol; 
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an agent ID stale machine coupled to the bus 
request logic that deternnines an agent ID for 
the bus conversion device in response to a RE- 
SET on the pipelined bus; 

a synnmetric owner state machine coupled to 5 
the agent ID state machine, and bus request 
logic, that monitors a current state of the pipe- 
lined bus and determines when the processor 
has obtained ownership of the pipelined bus; 
hold assertion logic coupled to the symmetric io 
owner state machine that generates a hold sig- 
nal input to the processor, the hold signal re- 
questing the processor to relinquish ownership 
of the pipelined bus. 

15 

14. The computer system of claim 13 wherein the bus 
conversion device further comprises: 

an outgoing request converter that converts re- 
quests from the processor into first and second 20 
request cycles on the pipelined bus; and 
an incoming request converter that identifies a 
transaction on the pipelined bus that is snoop- 
able by the processor, the transaction being in- 
put by the input request converter to the proc- 2S 
essor. 

15. The card of claim 14 wherein the bus conversion 
device further comprises: 

a cache coherency control unit that receives 30 
a signal from the processor indicating a cache hit to 
a modified line, the cache coherency control unit be- 
ing responsive to the signal by throttling dirty data 
from the modified line output by the processor to the 
pipelined bus. 35 

16. The card of claim 15 wherein the bus conversion 
device further comprises a cache coherency control 
unit that receives a signal from the processor indi- 
cating a cache hit to a modified line, the cache co- 40 
herency control unit also including a buffer that 
stores dirty data, the dirty data being output from 

the modified line by the processor being stored in 
the buffer and subsequently sent to the pipelined 
bus by the cache coherency control unit in accord- 45 
ance with the first signaling protocol. 

17. The card of claim 16 wherein the bus conversion 
device further comprises: 

a bus lock converter that converts a first lock so 
signal generated by the processor in conjunction 
with an atomic transaction into a second lock signal 
provided on the system bus in accordance with the 
first signaling protocol. 

55 

18. The computer system of claim 17 wherein the out- 
going request converter comprises: 



a request cycling state machine that generates 
a bus address strobe signal and control signals 
once the processor has gained ownership of 
the pipelined bus and has asserted a processor 
address strobe signal; 

multiplexer circuitry having a plurality of inputs 
and outputs selected by the control signals; 
request encoding circuitry that converts various 
request-type signals from the processor to the 
pipelined bus, the request-type circuitry being 
coupled to the multiplexer circuitry such that a 
single cycle request by the processor is se- 
quenced into a dual cycle request on the pipe- 
lined bus. 
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